Differences in the audio-visual detection of word prominence from Japanese and English speakers

نویسندگان

  • Martin Heckmann
  • Keisuke Nakamura
  • Kazuhiro Nakadai
چکیده

We have previously shown that for English speakers information on the mouth shape of a speaker is a powerful feature for the machine based discrimination of prominent from nonprominent words. In this paper we extend our analysis to data from Japanese speakers. We compare the discrimination performance of the different acoustic and visual features we extract for the two languages. This comparison shows a much wider variability in discrimination scores for the different speakers and the different features in the English dataset than in the Japanese dataset. Despite previous hints that visual speech and word prominence perception by Japanese listeners can yield inferior performance compared to English listeners we see that our discrimination scores are high and very similar for the English and Japanese speakers which indicates that at least the speakers signal prominence with a similar level of consistency in both languages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Analysis of Word Duration in Native Speakers and Japanese Speakers of English

An analysis of word duration in English sentences uttered by native speakers of Japanese is made, in which the difference in prosodic patterns between the English and Japanese languages is taken into account. The durations of Japanese speakers are compared with those of English speakers in regard to a percentage distribution of an individual word relative to all words in a sentence. The results...

متن کامل

Differences between Speakers in Audio-visual Classification of Word Prominence

We show how the audio-visual discrimination performance of prominent from non-prominent words based on an SVM classifier varies from speaker to speaker. We collected data in an experiment where users were interacting via speech in a small game, designed as a Wizard-of-Oz experiment, with a computer. Following misunderstandings of one single word of the system, users were instructed to correct t...

متن کامل

Steps Towards More Natural Human-Machine Interaction via Audio-Visual Word Prominence Detection

We investigate how word prominence can be detected from the acoustic signal and movements of the speaker’s head and mouth. Our research is based on a corpus with 12 English speakers which contains in addition to the speech signal also videos of the talker’s head. To extract the word prominence information we use on one hand functionals calculated on the features and on the other hand Functional...

متن کامل

Relative Importance in English and Persian: Thematization or Tonic Prominence?

There are two common ways to assign relative importance in spoken language: tonic prominence and thematization. The former is expressing the main points of information units in speech (Halliday, 1994), and the latter is putting an element at the beginning of a clause. This study explores how relative importance is realized in English and Persian. It also investigates how advanced Persian learne...

متن کامل

Perception of prosodic prominence and boundaries by L1 and L2 speakers of English

This study provides a direct comparison of boundary and prominence perception strategies between Japanese EFL learners and native speakers of English using the Rapid Prosody Transcription (RPT) method. Although RPT experiments are available for both native English speakers [1], [2], [3] and Japanese EFL learners [4], a direct comparison of the available data is problematic as the stimuli sets u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013